A Faster Algorithm for RNA Co-folding
نویسندگان
چکیده
The current pairwise RNA (secondary) structural alignment algorithms are based on Sankoff’s dynamic programming algorithm from 1985. Sankoff’s algorithm requires O(N) time and O(N) space, where N denotes the length of the compared sequences, and thus its applicability is very limited. The current literature offers many heuristics for speeding up Sankoff’s alignment process, some making restrictive assumptions on the length or the shape of the RNA substructures. We show how to speed up Sankoff’s algorithm in practice via non-heuristic methods, without compromising optimality. Our analysis shows that the expected time complexity of the new algorithm is O(Nζ(N)), where ζ(N) converges to O(N), assuming a standard polymer folding model which was supported by experimental analysis. Hence our algorithm speeds up Sankoff’s algorithm by a linear factor on average. In simulations, our algorithm speeds up computation by a factor of 3-12 for sequences of length 25-250. Availability: Code and data sets are available, upon request.
منابع مشابه
Role of counterion condensation in folding of the Tetrahymena ribozyme. II. Counterion-dependence of folding kinetics.
Condensed counterions contribute to the stability of compact structures in RNA, largely by reducing electrostatic repulsion among phosphate groups. Varieties of cations induce a collapsed state in the Tetrahymena ribozyme that is readily transformed to the catalytically active structure in the presence of Mg2+. Native gel electrophoresis was used to compare the effects of the valence and size o...
متن کاملRNA Secondary Structure Prediction: The Co-transcriptional effect on RNA folding
RNA secondary structure prediction is an area of bioinformatics that is the subject of significant amounts of research; it is the process of turning a sequence of bases on an RNA strand, for example, AUGUAGCAUC into a map of the interactions between the base pairs. This report looks at different methods of approaching this problem using dynamic programming algorithms. Nussinov’s algorithm was u...
متن کاملEncoding folding paths of RNA switches
RNA co-transcriptional folding has long been suspected to play an active role in helping proper native folding of ribozymes and structured regulatory motifs in mRNA untranslated regions (UTRs). Yet, the underlying mechanisms and coding requirements for efficient co-transcriptional folding remain unclear. Traditional approaches have intrinsic limitations to dissect RNA folding paths, as they rel...
متن کاملNondenaturing Purification of Co-Transcriptionally Folded RNA Avoids Common Folding Heterogeneity
Due to the energetic frustration of RNA folding, tertiary structured RNA is typically characterized by a rugged folding free energy landscape where deep kinetic barriers separate numerous misfolded states from one or more native states. While most in vitro studies of RNA rely on (re)folding chemically and/or enzymatically synthesized RNA in its entirety, which frequently leads into kinetic trap...
متن کاملAn Optimized Parsing Algorithm Well Suited to RNA Folding
The application of stochastic context-free grammars to the determination of RNA foldings allows a simple description of the sub-class of sought secondary structures, but it needs efficient parsing algorithms. The more classic thermodynamic model of folding, popularized by Zuker under the framework of dynamic programming algorithms, allows an easy computation of foldings but its use is delicate ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008